MARTINGALE OPTIMAL TRANSPORT 
AND 

ROBUST HEDGING IN CONTINUOUS TIME 



YAN DOLINSKY AND H.METE SONER 
HEBREW UNIVERSITY OF JERUSALEM AND ETH ZURICH 



Abstract. The duality between the robust (or equivalcntly, model indepen- 
dent) hedging of path dependent European options and a martingale optimal 
transport problem is proved. The financial market is modeled through a risky 
asset whose price is only assumed to be a continuous function of time. The 
hedging problem is to construct a minimal super-hedging portfolio that con- 
sists of dynamically trading the underlying risky asset and a static position of 
vanilla options which can be exercised at the given, fixed maturity. The dual 
is a Monge-Kantorovich type martingale transport problem of maximizing the 
expected value of the option over all martingale measures that has the given 
marginal at maturity. In addition to duality, a family of simple, piecewise 
constant super-replication portfolios that asymptotically achieve the minimal 
super-replication cost is constructed. 



1. Introduction 

The original transport problem proposed by Monge [18] is to optimally move a 
pile of soil to an excavation. Mathematically, given two measures v and fi of equal 
mass, we look for an optimal bijection of K d which moves v to \i, i.e., look for a 
map S so that 

<f{S{x))dv{x) — I ip(x)dfj,(x), 

for all continuous functions ip. Then, with a given cost function c, the objective is 
to minimize 

/ c(x, S(x)) dv{x) 
Jm d 

over all bijections S. 

In his classical papers [T^l [TO] , Kantorovich relaxed this problem by considering 
a probability measure on R d x M. d , whose marginals agree with v and fj,, instead 
of a bijection. This generalization linearizes the problem. Hence, allows for an 
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easy existence result and enables one to identify its convex dual. Indeed, the dual 
elements are real-valued continuous maps (g, h) of M. d satisfying the constraint 



overall (g, h) satisfying the constraint II) . In the last decades an impressive theory 
has been developed and we refer the reader to [U [23l [24] and to the references 
therein. 

In robust hedging problems, we are also given two measures. Namely, the ini- 
tial and the final distributions of a stock process. We then construct an optimal 
connection. In general, however, the cost functional depends on the whole path 
of this connection and not simply on the final value. Hence, one needs to con- 
sider processes instead of simply the maps S. The probability distribution of this 
process has prescribed marginals at final and initial times. Thus, it is in direct 
analogy with the Kantorovich measure. But, financial considerations restrict the 
process to be a martingale (see Definition 12. 4[) . Interestingly, the dual also has a 
financial interpretation as a robust hedging (super-replication) problem. Indeed, 
the replication constraint is similar to (jl.lj) . These formal connection between the 
original Monge-Kantorovich problem and the financial problem is further discussed 
in Remark 12.71 and also in [3] . 

We continue by describing the robust hedging problem. Consider a financial 
market consisting of one risky asset with a continuous price process. As in the 
classical paper of Hobson [13] , all call options arc traded assets and can be bought 
for a "reasonable" price that is known initially. Hence, the portfolio of an investor 
consists of static positions in the call options in addition to the usual dynamically 
updated risky asset. This leads us to a similar structure as in JT3] and in other 
papers that consider robust hedging of model- independent price bounds. Apart 
from the continuity of the price process no other model assumptions are placed on 
the dynamics of the price process. 

In this market, we prove the Kantorovich duality, (Theorem l2.6[) and an approx- 
imation result, fTheorem 12.8ft , for a general class of path-dependent options. The 
classical duality theorem, for a market with a given semi-martingale, states that the 
minimal super-replication cost of a contingent claim is equal to the supremum of its 
expected value over all martingale measures that are equivalent to a given measure. 
We refer the reader to Delbaen & Schachcrmayer [10] (Theorem 5.7) for the general 
semi-martingale processes and to El-Karoui & Quenez |llj for its dynamic version 
in the diffusion case. Theorem 12.61 below, also provides a dual representation of 
the minimal super-replication cost but for the general model independent markets. 
Indeed, it is given as the supremum of the expectations of the contingent claim over 
all martingale measures with a given marginal at the maturity but with no domi- 
nating measure. Since no probabilistic model is prc-assumed for the price process, 
the class of all martingale measures is quite large. Indeed, typically martingale 
measures are orthogonal to each other and this fact renders the problem difficult. 
Also, the additional feature of the market that the call prices arc known, introduces 
the new marginal constraint. This in turn, provides the connection to the problem 
of optimal transport. 
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In the literature, there are two earlier results in this direction. In a purely discrete 
setup, a similar result was also recently proved by Beiglbock, Henry-Labordere and 
Pcnkner [3] . In their model, the investor is allowed to buy all call options at finitely 
many given maturities and the stock is traded only at these possible maturities. In 
this paper, however, the stock is traded in continuous time together with a static 
position in the calls with one maturity. In [4] the dual is recognized as a Mongc- 
Kantorovich type optimal transport for martingale measures and is essentially used. 

In continuous time, Galichon, Henry-Labordere and Touzi |12j prove a different 
duality and then use the dual to convert the problem to an optimal control problem. 
There are two main differences between our result and the one proved in |12j . The 
duality result, Proposition 2.1 in |12j . states that the minimal super-replication cost 
is given as the infimum over Lagrange multipliers and supremum over martingale 
measures without the final time constraint and the Lagrange multipliers are related 
to the constraint. Also the problem formulation is different. The model in [T^] 
assumes a large class of possible martingale measures for the price process. The 
duality is then proved by extending an earlier unconstrained result proved in |21j . 
As in the unconstrained model of J2TJ [22] , the super- replication is defined not path- 
wise but rather probabilistically through quasi-sure inequalities. Namely, the super- 
replication cost is the minimal initial wealth from which one can super-replicate the 
option with probability one with respect to all measures in a given class. In general, 
these measures are not dominated with one measure. As already mentioned this 
is the main difficulty and differs the current problem from the classical duality 
discussed earlier. However, our duality result together with the results of [12] 
implies that these two approaches - namely, robust hedging through the path-wise 
definition of this paper and the quasi-sure definition of |12[ 121] - yield the same 
value. This is proved in Section [3] below. 

Our second result provides a class of portfolios which are managed on a finite 
number of random times and asymptotically achieve the minimal super-replication 
cost. This result may have practical implications allowing us to numerically inves- 
tigate the corresponding discrete hedges, but we relegate this to a future study. 

The robust hedging has been an active research area over the past decade. The 
initial paper of Hobson [13] studies the case of the lookback option. The connection 
to the Skorokhod embedding is also made in this paper and an explicit solution for 
the minimal super-replication cost is obtained. This approach was further developed 
by Brown, Hobson and Rogers [5] and Cox and Obloj [7] and in several other papers. 
We refer the reader to the excellent survey of Hobson Q3] and the references therein. 
Also a similar modeling approach was applied to volatility options by Carr and 
Lee [6]. We refer to the recent paper by Cox and Wang [8] for more information 
and a discussion of various constructions of the Root's solution of the Skorokhod 
embedding. Also in a recent paper, Davis and Obloj [9] considered the variance 
swaps in a market with finitely many put options. In particular, in [5] the class of 
admissible portfolios is enlarged and numerical evidence is obtained by analyzing 
the S&P500 index options data. 

As already mentioned above, the dual approach is used by Galichon, Henry- 
Labordere and Touzi [12] and Henry-Labordere and Touzi [17] as well. In these 
papers, the duality provides a connection to stochastic optimal control which can 
be then used to compute the solution in a more systematic manner. 
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Our approach is to represent the original robust hedging problem as a limit of 
robust hedging problems which live on a sequence of countable spaces. For these 
type of problems, robust hedging is the same as the classical hedging, under the 
right choice of the probability measure. Thus we can apply the classical duality 
results for super-hedging of European options on a given probability space. The 
last step is to analyze the limit of the obtained prices. We combine methods from 
arbitrage-free pricing and limit theorems for stochastic processes. 

The paper is organized as follows. Main results are formulated in the next 
section. In Section [3J the connection between the quasi sure approach and ours 
is proved. The last two sections are devoted to the proof of one inequality which 
implies the main results. 

2. Preliminaries and main results 

The financial market consists of a savings account which is normalized to unity 
B t = 1 by discounting and of a risky asset St, t <E [0,T], where T < oo is the 
maturity date. Let s := So > be the initial stock price and without loss of 
generality, we set s = 1. Denote by C + [0,T] the set of all strictly positive functions 
/ : [0,T] — > M. + which satisfy fo = 1. We assume that St is a continuous process. 
Then, any element of C + [0,T] can be a possible path for our stock price process 
S. Let us emphasis that this the only assumption that we make on our financial 
market. 

Denote by T>[0, T] the space of all measurable functions v : [0,T] — > M. with the 
norm ||t>|| = sup 0<t<T \v t \. Let G : T>[0, T] — >• M be a given deterministic map. We 
then consider a path dependent European option with the payoff 

(2.1) X = G(S), 

where S is viewed as an element in T>[0, T]. 

2.1. An assumption on the claim. Since our approach is through approxima- 
tion, we need the regularity of the pay-off functional G. A discussion of this assump- 
tion is given in Remark 12.21 In particular, it is related to the classical Skorokhod 
topology, and Asian and lookback type options satisfy the below condition. We 
need the following definition. Let T>n[0,T] be the subset of X>[0,T] that are piece- 
wise constant with N possible jumps to — < t\ < t-x < . . . < ijv _■ T, i.e., 
v e V N [0,T] if and only if 

N 

vt = ^2 v iX[t i -i,t i )( t ) + v N+iX[t N ,T](t), where Vi := v u _ t . 
i=i 

We make the following standing assumption on G. 
Assumption 2.1. There exists a constant L > so that 

\G(u) - G(u>)\ <L\\u)-w\\, w,u>GV[0,T]. 
Moreover, let v,v € T>n[0,T] be such that Vi = Vi for all i = 1, N. Then, 

N 

\G(v)-G(v)\<L\\v\\J2\*t k -At k \, 
fc=i 

where as usual Atfc := — tk—i and Atk := tk — tk—i- 
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Remark 2.2. In our setup, the process S represents the discounted stock price 
and G(S) represents the discounted award. Let r > be the constant interest rate. 
Then, the payoff 

G(S) := e~ rT H ( e rT S T , min e rt S t , max e rt S u [ e rt S t dt] , 

\ 0<t<T 0<t<T J I 

with a Lipschitz continuous function H : K 4 — > M. satisfies the above assumption. 

The above condition on G is, in fact, a Lipschitz assumption with respect to a 
metric very similar to the Skorokhod one. However, it is weaker than to assume 
Lipschitz continuity with respect to the Skorokhod metric. Recall that this classical 
metric is given by 

d(f,g):=M sup max (\f(t) - $(A(t))|, |A(t) - 1\) , 

A 0<t<T 

where the infimum is taken over all time changes. A time change is a strictly 
increasing continuous function which satisfy A(0) = and A(T) = T. We refer 
the reader to Chapter 3 in [3] for more information. In particular, while J Q Sfdt 
is continuous with respect to the Skorokhod metric in C[0,T], it is not Lipschitz 
continuous in C[0, T] and it is not even continuous in 2?[0, T\. Although we assume 
S to be continuous, since in our analysis we need to consider approximations in 
2?[0,T], the above assumption is needed in order to include Asian options. 

Moreover, from our proof of the main results it can be shown that Theorems 
and 12.81 can be extended to payoffs of the form 



H e rtl S tl ,...,e rtk S tl , min e rt S u max e rt S t , / e rt S t dt 

\ 0<t<T 0<t<T J I 

where H is Lipschitz and < t\ < ... < < T. 

Finally, one can extend the results to barrier options, by considering a discretiza- 
tion that is adapted to the barriers. In this paper, we choose not to include this 
extension to avoid more technicalities. □ 



2.2. European Calls. Let fj, be a given probability measure on K + . At time zero, 
the investor is allowed to buy any call option with with strike K > 0, for a price 

C(K) J {x-K) + dfi{x). 

The probability measure fj, is assumed to be derived from observed call prices that 
are liquidly traded in the market. One may also think that fi describes the proba- 
bilistic belief of the firm for the stock price at time T . Then, a simple arbitrage and 
approximation arguments imply that the price of an option with the payoff g(Sr) 
with g € L 1 (R + ,/i) must be given by J gdfi. 

Also, since C(0) is the price of a forward, it must be equal to the initial stock 
price So which is normalized to one. Therefore, although the probability measure 
fj. is quite general, in order to avoid arbitrage, it should satisfy that 

(2.2) C(0) = / xdn(x) = So = 1. 
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Technically, we also assume that there exists p > 1 such that 
(2.3) / x p dfi{x) < co. 



2.3. Admissible portfolios. We continue by describing the continuous time trad- 
ing in the underlying asset S. Since we do not assume any semi-martingale structure 
of the risky asset, this question is nontrivial. We adopt the path- wise approach and 
require that the trading strategy (in the risky asset) is of finite variation. Then, for 
any function h : [0,T] —> M. of finite variation and continuous function S G C[0,T], 
we use integration by parts to define 

t nt 

h u dS u := htSt — hoSo — / S u dh u , 
o Jo 

where the last term in the above right hand side is the standard Stieltjes integral. 

We are now ready to give the definition of semi-static portfolios and super- 
hedging. Recall the exponent p in p. 31) . 



Definition 2.3. 1. We say that a map 

<p : A C V[0,T] -> V[0,T] 
is progressively measurable, if for any v, v G A, 
(2.4) v u = v u , Vue[0,i] => 4>(v) t = <t>(v) t - 

2. A semi-static portfolio is a pair it := (3,7), where g G L 1 (R + ,/i) and 

7 : C+[0,T] -> D[0,T] 

is a progressively measurable map of bounded variation. 

3. The corresponding discounted portfolio value is given by, 

Z?(S) = g(S T )x { t=T} + f\u(S)dS u , t G [0,T], 
Jo 

where xa is the indicator of the set A. A semi-static portfolio is admissible, if there 
exists M > such that 



(2.5) Zf(S)>-M 1+ sup S p u , V<G[0,T], 5gC+[0,T]. 

V 0<u<t / 

4. An admissible semi-static portfolio is called super- replicating, if 

Z^{S)>G{S), VSeC+[0,T}. 

5. The (minimal) super-hedging cost of G is defined by, 

V(G) := inf {/. 9 ^ : 3 7 ««* that n := ( 5 , 7 ) i. super-replicating } 



□ 



Notice that the set of admissible portfolios depend on the exponent p which 
appears in the assumption (|2.3[) . We suppress this possible dependence to simplify 
the exposition. 
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2.4. Martingale optimal transport. Since the dual formula refers to a proba- 
bilistic structure, we need to introduce that structure as well. Set £1 := C + [0,T] 
and let § = (§t)o<t<T be the canonical process given by §t(w) := w-t, for all w£!l. 
Let Tt ■— cr(S s , < s < t) be the canonical filtration. 

The following class of probability measures are central to our results. Recall that 
we have normalized the stock prices to have initial value one. Therefore, below the 
probability measures need to satisfy this condition as well. 

Definition 2.4. A probability measure P on the space (f2, J-*) is a martingale 
measure, if the canonical process (8 t )^ is a local martingale with respect to P and 
§ = 1 P-a.s. 

For a probability measure [i on R + , M M is the set of all martingale measures P 
such that the probability distribution of St under P is equal to fi. □ 

Note that if /i satisfies ()2.2j) . then the canonical process (§t)|L is a martingale 
(not only a local martingale) under any measure P € M M . 

Remark 2.5. Observe that (|2.2|) yields that the set M M is not empty. Indeed, 
consider a complete probability space {fl w , T w , P w ) together with a standard 
one-dimensional Brownian motion (Wt)^ , and the natural filtration which 
is the completion of (rjM^ls < t}. Then, there exists a function / : R — > W.+ such 
that the probability distribution of f(Wr) is equal to fi. Define the martingale 
M t := E w (f{W T )\JT)i t G [0, T}. In view of ((HI, M = 1. Since M is a Brownian 
martingale, it is continuous. Moreover, since /j, has support on the positive real line, 
/ > and consequently, M > 0. Then, the distribution of M on the space VL is an 
element in M^. □ 

The following is the main result of the paper. An outline of its proof is given in 
the subsection 12.61 below. 

Theorem 2.6. Assume that the European claim G satisfies the Assumption \2.1\ and 
the probability measure /i satisfies (|2.2| and (|2.3| . Then, the minimal super-hedging 
cost is given by 

V{G) = sup E P [G(S)] , 
where Ep denotes the expectation with respect to P. 

Remark 2.7. One may consider the maximizer, if exists, of the expression 

sup E P [G(S)] , 

as the optimal transport of the initial probability measure v = Sin to the final 
distribution /i. However, an additional constraint that the connection is a martin- 
gale imposed. This in turn places a restriction on the measures, namely (|2.2p . The 
penalty function c is replaced by a more general functional G. In this context, one 
may also consider general initial distributions v rather than Dirac measures. Then, 
the martingale measures with given marginals corresponds to the Kantorovich gen- 
eralization of the mass transport problem. 

The super-replication problem is also analogous to the Kantorovich dual. How- 
ever, the dual elements reflect the fact that the cost functional depends on the 
whole path of the connection. 

The reader may also consult [4] for a very clear discussion of the connection 
between the robust hedging and the optimal transport. □ 
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2.5. A discrete time approximation. Next we construct a special class of simple 
strategies which achieve asymptotically the super-hedging cost V. 

For a positive integer TV and any S G C + [0, T], set Tq(S) = 0. Then, recursively 
define, 

(2.6) rf \S) = inf (t > rfl(S) : \S t - 5^, (S) | ^}aT, 

where we set t^ N \s) = T, when the above set is empty. Also, define 

(2.7) H (N \S) = min{fc G N : rf°(S) = T}. 

Observe that for any 5 G C+[0, T], H^(S) < oo. 

Denote by A/v the set of all portfolios for which the trading in the stock occurs 
only at the moments = t (JV) (S) < t[ N) (S) < ... < r^l )(s) {S) = T. Formally, 
7r := (5,7) € An, if it is progressively measurable in the sense of (|2.4p and it is of 
the form 

ff (N '(S)-l 

7t(5)= 5^ T fc ^)^(r< N '(5),r<^(S)]( t )' 
fe=0 

for some 7fe(5)'s. Note that, jk(S) can depend on S only through its values up to 
time rj. (S), so that 7 t is progressively measurable. Set 



Vn(G) := inf I J gd[i : EI7 such that 7r := (g, 7) G ^Iat is super-replicating j . 

It is clear that for any integer k > 1, Vjv(G) > Vfe W (G) > 7(G). The following 
result proves the convergence to V(G). This approximation result is the second 
main result of this paper. Also, it is the key analytical step in the proof of duality. 

Theorem 2.8. Under the assumptions of Theorem \2.b\ 

lira V N {G) = V(G). 

Af-s-oo 



2.6. Proofs of Theorems 12.61 and 12.81 In order to establish Theorem 12.61 and 
Theorem 12.81 it is sufficient to prove that 

(2.8) lim sup V N (G) < sup E P [G(S)] 

and 

V(G) > sup E P [G(S)] . 

The first inequality is the difficult one and it will be proved in the last two sections. 
The second inequality is simpler and we provide its proof here. 

Let P G M M and let tt = (5,7) be super- replicating. Since 7 is progressively 
measurable in the sense of (|2.4[) . the stochastic integral 

/ Ju($)d§ u 
Jo 
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is defined with respect to P. Also P is a martingale measure and G satisfies the 
lower bound ()2.5|) . Hence, the above stochastic integral is a P local-martingale. 
Moreover, 

/ Ju(§)d$ u > G(S) > -M(l + sup \S t \ p ). 

JO 0<t<T 

Also in view of (|2.3j) and the Doob-Kolmogorov inequality, 

E P sup |St| p < C p E p \$t\p = C p [ \x\ p dii< 

0<t<T J 



00. 



Therefore, Ep "/ u (S)dE> u < 0. Since ir is super-replicating, this together with 
(|2.5j) implies that 

E P [G(§)\ < Ep [ / 7 „(S)dS t , + ff (Sr) ) < E P [<?(S T )] = / gdfx, 



where in the last equality we again used that the distribution of §t under P equals 
to /i. This completes the proof of the lower bound. Together with (|2.8[) . which will 
be proved in the last sections, it also completes the proofs of the theorems. □ 



3. Quasi sure approach and full duality 

An alternate approach to define robust hedging is to use the notion of quasi 
sure super- hedging as it was done in JT2J [21] . Let us briefly recall this notion. Let 
Q be the set of all martingale measures P on the canonical space C + [0,T] under 
which the canonical process § satisfies So = l,P-a.s., and has quadratic variation 
(see for the precise definition). In this market, an admissible hedging strategy 
(or a portfolio) is defined as a pair it — (5,7), where g £ Li(M + ,/i) and 7 is a 
progressively measurable process such that the stochastic integral 

/ 7 „d§„, te [0,T] 
Jo 

exists for any probability measure in the set P £ Q and satisfies (|2.5|) P-a.s. We 
refer the reader to [21j for a complete characterization of this class. In particular, 
one does not restrict the trading strategies to be of bounded variation. A portfolio 
7r = (g, 7) is called an (admissible) quasi-sure super-hedge, provided that 

g(Sr) + I ludSu > G(S), P a.s., 



for all P € Q. Then, the minimal super-hedging cost is given by 

V qs (G) := inf [J gd^i : 37 such that 7r := (5,7) is a quasi-sure super-hedge 
Clearly, 

V(G) > V qs (G). 
Proposition 2.1 in [17] states that 

Vas(G) = inf supEp ( G(S) - A(S T ) + / Ad/x 
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Since inf sup > supinf, the above two inequalities yield, 
V(G) > V qs (G) 

inf supE P f G(S) - A(S T ) + / \d L i ) 

AeLi(R +lAI ) PeQ V J J 

> sup inf E P ( G(S) - X(St) + [ Xdii ) 

PeC AGLi(M + , M ) V J ) 



Hence, we can restrict the maximization over all P <E M^. We then use Theorem 



Now if P ^ , then the infimum over A in the above expression is minus infinity, 
ice, we can n 
to arrive at 

V(G) > Va S (G) = inf supEp ( G(§) - A(S T ) + / \{x)dLi(x) 

AGL 1 (B + , M )p eQ 



> sup inf Ep G(§) - A(S T ) + / Xdu 

P 6 gAGLi(R +lM ) V 

> sup Ep [G(S)] = V(G). 



Hence, all terms in the above are equal. We summarize this in the following which 
can be seen as the full duality. 

Proposition 3.1. Assume that the European claim G satisfies Assumvtion \2. 1\ and 
the probability measure fx satisfies (|2.2[) . (|2.3[) . Then, 

V(G) = V qs (G) = sup Ep[G(§)] 

inf supEp ( G(S) - A(S T ) + / Xdfi ) 

AeLi(R +lAI )p eQ V J J 

= sup inf Ep ( G(S) - AfSr) + f Xdu 

p eQ AGLi(K+,^) V J 



4. Proof of the main results 
The rest of the paper is devoted to the proof of (|2.8jl . 

4.1. Reduction to bounded claims. We start with a reduction to claims that 
are bounded from above. Towards this result, we first prove a technical result. 
Consider a claim with pay-off 

a K (S) : =(||S|| + l) X{||s||+1 > x} + M. 

Recall that Vn(<xk) is defined in subsection 12.51 
Lemma 4.1. 

limsup limsup Vn(o.k ) = 0. 

K—¥oo N^rOO 

Proof. In this proof, we always assume that N > K > 2. Let tj. = ty (S) and 
n = H (N) (S) be as in p^]> . pTfjl . respectively, and set 

;=6 < £ ) (S) = min {k : S Tk > K - 2} A n. 
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We next define a portfolio (g( N - K \j( N - K y) g An as follows. For t <G (rk,Tk+i] and 
k = 0, 1, set 

7 (»,K )(S) _ T ,»,„ (S) . ( o§&S; -,) ( ,J & S?r .) 

and with c p := p/ (p — 1) define g( N ' K ) by 

9 {N ' K) (x) = 1(1 + (Mf - c p ) + ) + (fox)* - fo (X - 2))*>) + + |. 

We use Proposition 2.1 in [2] and the inequality x < 1 + x p , x S R+, to conclude 
that for any te [0, T] 

g [N ' K) {S t ) + [ J u dS u > — f max 5 U ] + ( max 5„ + 1 ] X{ maXo<u<t s„+i>A'}- 
Jo ^ yO<M<i J \p<u<t J - - 

Therefore, iry N ' K > ■= (g N ' K \ j( N > K )^) satisfies (|2.5[) and super-replicates ax- Hence, 

V N (a K ) < fgWMdn. 

Also, in view of (|2.3|) . 

limsup limsup / g^ N ' K ^dfi = 0. 

A'— >co ]V— >oo J 

These two inequalities completes the proof of the lemma. □ 

A corollary of the above estimate is the following reduction to claims that are 
bounded from above. 



Lemma 4.2. If suffices to prove (|2.8[) for claims G that are bounded from above 
and satisfying the Assumvtion \2. 1\ 

Proof. Suppose (|2.8[) holds for bounded from above claims satisfying the Assump- 
tion 12.11 Let G be a general claim satisfying Assumption 12.11 For a constant K, 
set 

G K :=GAK. 

Since Gk is clearly bounded from above, (|2.8|) holds for Gk- Therefore, 

limsup V N (G K ) < sup E P [G K (S)] < sup E P [G(§)] . 

n^oo pgml rei„ 



In view of Assumption 12. 1[ 

G(S) < G(0) + L\\S\\. 
Hence, the set {G(S) > K} is included in the set {L\\S\\ + G(0) > K} and 

G<G K + (L\\S\\ + G(0)) X{l\\s\\+g(0)>k } . 
By the linearity of the market, this inequality implies that 

V N (G) < V N {G K ) + V N ((L\\S\\ + G(0)) X{l\\s\\+g(o)>k}) ■ 
Moreover, in view of the previous lemma, 

limsup limsup V N ((L\\S\\ + G(0)) X{l\\s\\+g(o)>k}) =0. 

K— >oo TV— >oo 

Using these, we conclude that 

limsup V N (G) < sup E P [G{8>)} , 

N~>od PeML 
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as required. □ 

4.2. A countable class of piecewise constant functions. In this section, we 
provide a piece-wise constant approximation of any continuous function S. Fix a 
positive integer N. For any S G C+[0,T], let rf (S) and H^(S) be the times 
defined in (|2.6[) and (|2.7p . respectively. To simplify the notation, we suppress their 
dependence on S and iV and also set 

(4.1) n = H iN) (S). 

We first define the obvious piecewise constant approximation S = S^- N \S) using 
these times. Indeed, set 



X{T}(t), 



k=0 

where as usual sign(x) = 1 if x > 0, and sign(x) = —1 for x < 0. 

The function, that takes 5 to S 1 is a map of C + [0, T] into the set of all functions 
with values in the target set 

AW={i/ N : i = 0,1,2,.,.,}- 

Indeed, S is behind the definition of the approximating costs Vn- However, this set 
of functions is not countable as the jump times are not restricted to a countable set. 
So, we provide yet another approximation by restricting the jump times as well. 

Let B[0,T] be the space of all right continuous functions / : [0,T] — > R + with 
left-hand limits (cadlag functions). For integers N, k, let 

UW := {i/{2 k N) : i = 1, 2, . . . , } U {l/(i2 k N) : i = 1, 2, . . . , } , 

be the sets of possible differences between two consecutive jump times. Next, we 
define a subset of Cl. 

Definition 4.3. A function / e B[0, T] belongs to B^, if it satisfies the followings, 
1- /(0) = 1, 

2. / is piecewise constant with jumps at times ti, ...,t n , where 

t Q = < ti < t 2 < ... < t n < T, 

3. for any k = l,...,n, \f(t k ) - f{t k -x)\ = 1/N, 

4. for any k = 1, ...,n, t k - t h _ x G U ( k N) . □ 

We emphasize, in the fourth condition, the dependence of the set f/£ on k. So 
as k gets larger, jump times take values in a finer grid. 
We now define an approximation 

:C + [0,T]^BW, 

as follow. Recall r k = T { k N) (S), n = H^(S) from above and also (JM]), $27$. Set 
fo := 0, and for k = 1, n, define 

k 

r k := ^2 At,, 

i=l 

Afj = max{At G f//^ : Ai < An = n - n-x}. 
It is clear that = % < fi < ... < f n < T and < r k for all fc = 0, . . . , n. 
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Wc now define F^(S) by, 

n-l 

(4.2) F t iN) (S) = Y / S TkXl f k ,r k+l) (t) 



k=0 



Sy»_i + —sign(Sr ~ S Tn _ 1 ] 



X[f n ,T](t). 



Observe that F^ N ^(S) and S differs only through the jump times. Indeed the jump 
times of S are r^'s while the jump times of are ffc's. We recall again that 

all these times depend both on N and S. Also, by construction, F^- N ^(S) £ O^). 
But, it may not be progressively measurable as defined in (|2.4p . However, we use 
p( N ) on iy to lift progressively measurable maps defined on H^ N > to the initial space 
il = C + [0,T] and this yields progressively measurable maps on f2. This procedure 
is defined and the measurability is proved in Lemma 14.71 below. 

The following lemma shows that is close to S in the sense of Assumption 

12.11 We also point out that the following result is a consequence of the particular 
structure of 1D>W and in particular a. 

Lemma 4.4. Let F^ be the map defined in (|4.2 
Assumption \2. 1\ with the constant L, 

2>L\\S\ 



For any G satisfying the 



G{S) - G{F^ N \S)) 



< 



N 



V S e C+[0,T] 



Proof. First recall Assumption 12.11 and observe that S and F = F^ N ' are like the 
functions v and v in that definition. Hence, 



G(S)-G(F) <L\\S\\ ^|Ar fe -Af fc | 



k=l 



For k < 



Af fc = max | At € U { k N) : At < Ar k } 



The definition of implies that 
< Ar k - Af fc < 



2 k N' 



l,...,n-l. 



Therefore, 

n oo -. -. 

fc=i fc=i 
Combining the above inequalities, we arrive at 

L\\S\\ 



G(S) - G(F) 



< 



N 



We now directly estimate that 

\G{S)-G(F)\ < G(S)-G(S) 



< L\\S-S\\ 



L\\S\ 
N 



G{S) - G(F) 

_ ms\\ 

N 



□ 
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Remark 4.5. The proof of the above Lemma provides one of the reasons behind 
the particular structure of U^ N \ Indeed, (|4.3p is a key estimate which provides a 
uniform upper bound for the sum of the differences over k. Since there is no upper 
bound on k, the approximating set uj^^ for the k-th difference must depend on k. 
Moreover, it should have a summable structure over k. That explains the terms 2 fc . 

On the other hand, the reason for the part {l/(z2 fe 7V) : i = 1, 2, . . . } in U^' is 
to make sure that Affc > 0. For probabilistic reasons (i.e. adaptability), we want 
Tk < Tfc. This forces us to approximate At^ by Affe from below. This and Affe > 
would be possible only if uj: has a subsequence converging to zero. 

Hence, different sets of f/^'s are also possible provided that they have these 
two properties. □ 

4.3. A countable probabilistic structure. An essential step in the proof of (|2.8|) 
is a duality result for probabilistic problems. We first introduce this structure and 
then relate it to the problem Vn- 

Let Cl := D[0, T] be the space of all right continuous functions / : [0,T] — > M. + 
with left-hand limits (cadlag functions). Denote by S = (§t)o<t<T the canonical 
process on the space O. 

The set defined in Definition 14.31 is a countable subset of O. We choose any 
probability measure pW on Cl which satisfies pW (1D)W) = 1 and P (N) ({f}) > 
for all / 6 B (7V ). Let J" t (W) , i 6 [0,T] by the filtration which is generated by the 
process § and satisfying the usual assumptions (right continuous and contains pW 
null sets). Under the measure P, the canonical map § has finitely many jumps. Let 

= f (S) < n(S) < . . . < f m (S) < T, 

be the jump times of S. Note that in Definition 14. 3[ the final jump time is always 
strictly less than T. 

Then, a trading strateg y on the filtered probability space (fi, P N \ (J ri t N) )'[ =0 , pM) 
is any progressively measurable stochastic process (jt)f =0 . Given the special form 
of the canonical process and the definition of stochastic integrals, there is no loss 
of generality to restrict the trading strategy to be of the following form, 

H(§)-1 

(4.4) <%(§)= 7 fe (§)x (ffc( i),f fc+1 (i) ] W+7 J J (§) (S)x (ff}(§)( i), T] W, 

k=0 

for some % for k = 0, . . . , H(S). 

Moreover, since 7 is progressively measurable, there exists a map 

(j) : D[0,T] -)• T>[0,T], 

so that it is progressively measurable in the sense of (|2.4[) and 7 = <j>(S), P-a.s. Since 
we may always work with (f> instead of 7, there is no loss of generality in assuming 
that 7 is itself is progressively measurable in the sense of (|2.4j) . In particular, if 
S, S G V> [N) satisfy f k (S) = f k {S) and 

(4.5) §„=§„, V«6[0,u(S)], then %(§)=%(§). 
We now give the probabilistic counterpart of the Definition 12.31 
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Definition 4.6. 1. A (probabilistic) semi-static portfolio is a pair (h, 7) such that 
7 : D[0,T] X>[0,T] is given as in flU} satisfying (j43|), and h : A (Ar) R. 
2. A semi-static portfolio is pW -admissible, if /i is bounded and there exists A/ > 
such that 

(4.6) / %d§ u > —M, ¥ N) ~a.s., t e [0,T]. 



3. An admissible semi-static portfolio is pW -super-replicating, if 

(4.7) ft(Sr) + / 7 A > G(S), P W - a.s. 

Jo 

□ 

4.4. Approximating /i. Recall the set An of portfolios used in the definition of 
Vn in subsection 12.51 

Next we provide a connection between the probabilistic super-replication and 
the discrete robust problem. However, h in Definition 14.61 above is defined only on 
A( N 1 while the static part of the hedges in An are functions defined on M.+ . So for 
a given we define the following operator 



3 



(JV) 



by 

5 (Ar) (x) := (1 + [Nx\ - Nx)h([Nx\/N) + (Nx - [Nx\)h((l + [Nx\)/N), 

where for a real number r, [rj is the largest integer that is not larger than r. 
Next, define a measure fj,( N ) on the set A^ by 

r i/N 
M W ({0}) := / 



and for any positive integer fc, 

/•fc/JV f(k+l)/N 

^ N) ({k/N}) := (Nx+l-k)dfJ,(x)+ (1 + k- Nx)dn(x). 

J(k-1)/N Jk/N 

This construction has the following important property. For any bounded function 
h : A^ R, let 5 W = be as above. Then, 

,(N) _ f (N) 



(4.8) / hd^> = J gWdp. 

In particular, by taking h = 1, we conclude that is a probability measure. 

Also, since g^ converges pointwise to g, one may directly show (by Lebesgue's 
dominated convergence theorem) that converges weakly to \i. 

4.5. Probabilistic super-replication. We now introduce the super-replication 
problem by requiring that the inequalities in (|4.7p hold P almost surely. Let G be 
a European claim as before and N be a positive integer. Then, the probabilistic 
super-replication problem is given by, 

V N (G) = inf I J hdpi {N) : 3 7 s.t. (h, 7) is a P {N) admissible super hedge of g\ . 

Recall that in the probabilistic structure, admissibility and related notions are 
defined in Definition 14.61 
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We continue by establishing a connection between the probabilistic super hedging 
and the discrete robust problem. Suppose that we are given a probabilistic semi- 
static portfolio 7r = (<7, 7) in sense of Definition 14.61 We lift this portfolio to a 
semi-static portfolio = (g^ N \"/^ N ^) in the sense of Definition 12.31 Indeed, let 
be as in subsection gU and set 7W : C+[0, T] -> D[0, T] by 



7i 



(S) = E^( fW (S))xh,r H1 ]W, 



k=0 
IN) 



where 7* is as in (JO), T fe = '(S) be as in pT6]l . n be as in f|47T|) and F {N) {S), 
Tk '■= Tk(S) be as in (|4.2[) . Notice that n = H( N )(S), the number of jumps of S, is 
equal, by construction, to the number of jumps of F^ N ', 

Lemma 4.7. For any probabilistic semi-static portfolio (5,7), 7^ defined above 
is progressively measurable in the sense of (|2.4[) . 

Proof. Let S, S € C+[0,T] be such that S" u = S„ for all u < t for some t G [0,T]. 
We need to show that 

Since the above clearly holds for t = and t = T, we may assume that i E (0, T). 
Set 

jfc t (S) := := inf{i > 1, : rf > t } - 1, 

so that < fc f (5) < n and 

i e r W (N) ] 

By the definition of {r^'j's, fc t (S) = fc t (S). We now use the definition of ffc to 
conclude that 

:= ^fc t (S) = ^ t (S)(5) = T kt{ g } (S). 

Hence, 

Moreover, the definition of F^ N > implies that 

Fi N HS)=Fi N \S), Vue[O,0}. 
Therefore, by the progressive measurability of 7, f|4. 5[) . ^ N \s) — 7J (S). 

□ 

The following lemma provides a natural connection between the probabilistic 
super-replication and the discrete robust problem. 

Recall the set .4^ of portfolios used in the definition of Vn in subsection 12.51 

Lemma 4.8. Suppose G is bounded from above and satisfies the Assumvtion \2. 1\ 
Then, 

limsupVjv(G) < limsup Vjv(G). 



Proof Set 



We first show that 



GW(5):=G(5) 41,11511 



iV 



V N (G^)) < V N {G). 
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To prove the above inequality, suppose that a portfolio (h, 7) is a pW -admissible 
super hedge of G. Then, it suffices to construct a map : C + [0,T] — > T>[0, T] 
such that the semi-static portfolio 77'^ :~ (g^ N \ 7^) is admisible, belongs to Aw 
and super- replicates G^ N ' in the sense of Definition 12.31 

Let g( N > be as in subsection !4.4l and 7 W be the probabilistic portfolio considered 
in Lemma T4. 71 We claim that ttW is the desired portfolio. In view of Lemma 14771 
we need to show that 77^ is in An an d super-replicates the in the sense of 

Definition [2731 

To simplify the notation, we set F := F^ N '(S). 

Admissibility of . 

By construction trading is only at the random times Tfe's. Therefore, G An 
provided that it satisfies the lower bound (|2.5[) for every t G [0, T]. Fix such a time 
point t G [0,T]. Since F G B (Ar) , in view of flU]) , there exist M so that 



/ %(F)dF u > —M, Vt G [0,T], fe 
J[o,t] 



We also observe that by construction 

for every fc satisfying < t. However, the above integrals on (r^, i) may not agree. 
So for a given r, G [0, T] and 5 G C + [0, T], let fc be the largest integer so that Tk < t. 
Construct a function F G (depending on t, S, but the dependence suppressed 

in the notation) by, 

F[a,f k+1 ) = F[o,f k+1 ), (i.e., F U =F U , V it G [0,f fc+ i),) 

and 

where we set f n +i = T . Note that the constructed function F depends on S and 
N, since both F and the stopping times Tt depend on them. But we suppress these 
dependences. Since 

F* k+1 - p r k = ~ [Ff k+1 - F fk ] = ±l/N, 

and since 

\S t -S Tk \<l/N, 
there exists A G [0, 1] (depending on t) such that 

St - S Tk = X(F fk+1 -F fk ) + (1- X)(F fk+1 - F fk ). 

Since F and F agree on [0,f fc+1 ), and 7 is progressively measurable (c.f. ([4.510 . 
%(F) = %(F) for all u < f k+1 . Therefore, 

C ^ N) (S)dS u = X [ %{F)dF u + (1 — A) f iu{F)dF u . 

Also both F, F G ID) (Ar) , and F(F),P(F) > 0. Using fl4l| we may conclude that 
f %{F)dF u > —M, and f %{F)dF u > —M, 

J[0,f k+1 ] J[0,T k+1 ] 

Hence, rr (Ar) G An- 
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Super-replication. 
We need to show that 



{N) (S T )+ / ^ u N HS)dS u >G^(S). 



We proceed almost exactly as in the proof of admissibility. Again we define a 
modification F £ by frof n _ 1 ) = -Fjo,f„_i) an d F u = -Ff„_i for u > f n _i. Set 

A := AT|S T - ^r„_J- 

Then A £ [0, 1] and by the construction of g^ N \ 

5 W(S T ) = Aft(F r ) + (1 - X)h(F T ). 

Hence, 



ff (A,) (5r) + / 



A 



fr(*r) + / %{F)dF u 



+ (1 - A) 



M-Fr) + / %{F)dF u 



> XG(F) + (1 - A)G(F). 
Since \\F — F\\ < 1/N, Assumption 12.11 and Lemma T4.4I imply that 

4LIISI 



N 



G(S)-G(F) > \G(S) - G(F)\ + G(F) - G(F) - 
Consequently, 

\G(F) + (1 - X)G(F) > G {N) {S) 
and we conclude that n^- N ' is super-replication G^ N \ 

Completion of the proof. 
We have shown that 

V N (G-4L\\S\\/N) < V N (G). 

Moreover, the linearity of the market yields that super-replication cost is sub- 
additive. Hence, 

V N (G) < V N (4L\\S\\/N) + V N (G - 4L\\S\\/N). 
V N (G) < V N (4L\\S\\/N) + V N (G). 



Therefore, 
Finally, by Lemma |4. 11 



limsup V N (4L\\S\\/N) = 0. 

N— s-oo 



We use the above inequalities to complete the proof of the lemma. 



□ 
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4.6. First duality. Recall the countable set C tl and its probabilistic struc- 
ture were introduced in subsection l4.3l Wc consider two classes of measures on this 
set. 

Definition 4.9. 1. We say that a probability measures Q on the space is 
a martingale measure, if the canonical process (E>t)J =0 is a local martingale with 
respect to Q. 

2. Mjv is the set of all martingale measures that are supported on O^. 

3. For a given K > 0, M.^' is the set of all measures Q £ Mn that satisfy 

K 



(4.9) |q = k / N ) - ^ [N) (W N }) 



k=0 



□ 



The following follows from known duality results. Also, we will combine it with 
and Proposition 15.11 that will be proved in the next section to complete the 
proof of the inequality (|2. 



Lemma 4.10. Suppose that G bounded from above and satisfies the Assumption 
\2.1l Then, there is K > so that for any positive integer N , 

Vn(G) < sup E Q |g(S) 

Proof. Fix N and set 

(4.10) k := sup E Q |g(S) 

Q6MW L 

If k < 0, we may consider G — k instead G. Hence there is a no loss of generality 
in assuming that k > 0. Let K be an upper bound for G and define the set 

Z = ZW := {h : A™ -> R : \h(x)\ < N, V.t}. 

Set 

V:=inf sup [E Q (G(S) - h(S T )) + [ hd^ N A . 
Clearly, for any e > 0, there exists h £ Z such that 

sup E Q (G(S) - h{S T )) + [ hd^ N) < V + e. 

By construction, the support of the measure P is W- N ^ . Also all elements of 
are piece-wise constant. Thus, P almost surely, the canonical process § is trivially 
a semi- martingale. Hence, we may use the results of the seminal paper jlOj . In 
particular, by Theorem 5.7 in |10j . there exists an admissible portfolio strategy 7 

such that for x > sup,g, eMiv Eq (g(S) — h(^x)j , we have 

x + / %dS u > G(S) - h{S T ), P (N) a.s. 



Thus, (h + x,j) satisfies (|4T5 |) -(|4T7 ) . and V N (G) < V + e. Wc now let e to zero to 
conclude that 

(4.11) V N (G) < inf sup (E Q (G(S) - h(S T )) + f hd^ N] 
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The next step is to interchange the order of the above infimum and supremum. 
Consider the vector space R^ <JV) of all functions / : — s- R cquipcd with the 
topology of point-wise convergence. Clearly, this space is locally convex. Also, since 
A( N 1 is countable, Z is a compact subset of R" 4 ' . The set Mat can be naturally 
considered as a convex subspace of the vector space R d(JY) . 

Now, define the function Q : Z x M N -> R, by 



G(h,Q) = Eq (g(§) - h(Sr)) + f hd^ N) 



Notice that Q is affine in each of the variables. From the bounded convergence 
theorem, it follows that Q is continuous in the first variable. Next, we apply the 
min-max theorem, Theorem 2, in [4] to Q. The result is, 

inf sup g(h,Q)= sup inf Q(h,Q). 



This together with pTTTj) yields, 

(4.12) V N (G)< sup inf (Eq(G(S) - /i(S T )) + ! hd^ N) J . 

Finally, for any measure Q £ Mjv, define fft € Z by 

/i Q (fc/iV) = Assign (<TJ (s t = k/N) - ({k/N})) , k = 0, 1, . . . . 
In view of (j4~T2]l . 

Vat(G) < sup (Eq(G(S))+ [h®dn (N) -E Q /i Q (S T ) J 

= sup J E Q (G(B)) - JV ^ |(Q) (St = fc/iv) - ^ ({fc/JV}) 



Suppose that Q £ M^°. Then, 

OO 

|Q (S T = k/N) - /i (Ar) ({k/N}) > K. 

k=0 

Since G is bounded by K , this implies that 

oo 

E Q (G(B)) - N^ |Q (s t = fc/JV) - ({W) 

fe=0 

However, since we assume that k defined in (|4.10[) is non-negative, the upper bound 
is then proved. So we may assume that Q £ Mjy . Then, 

{oo 
E Q (G(B)) -N V |(Q (St = k/N) - /iW ({k/N}) 
k=o 1 V ' 



< 0. 



< sup E Q (G(§)). 



□ 
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5. Approximation of Martingale Measures 

In this final section, we prove the asymptotic connection between the approx- 
imating martingale measures M^-* defined in Definition 14.91 and the continuous 
martingale measures M M satisfying the marginal constraint at the final time, de- 
fined in Definition 12.41 

The following proposition completes the proof of the inequality (|2.8j) and conse- 
quently the proofs of the main theorems. 

Proposition 5.1. Suppose that G is bounded from above and satisfies the Assump- 
tion \2.1[ Assume that /i satisfies (|2.2[) - (|2.3[) . Let K be as in Lemma \4-. 10\ Then, 

limsup sup E Q G(S) < sup E P [(?($)] . 



We prove this result not through a compactness argument as one may expect. 
Instead, we show that any given measure Q G has a lifted version in that 

is close to Q in some sense. Indeed, the above proposition is a direct consequence 
of the below lemma. 

Recall the Lipschitz constant L in Assumption 12. II 

Lemma 5.2. Under the hypothesis of Proposition I5.il there exists a continuous 
function fx with /if (0) = 0, so that for any Q G M^y and e > 0, 



G(§) < ^ + M e ) + SU P e p I G ( § )] 



Proof. Fix e G (0, 1) , a positive integer N and Q G M^y . As in the proof of Lemma 



15 . H we assume without loss of generality that G is bounded from above by K. 

Jump times. Since the probability measure Q is supported on the set M^ N \ the 
the canonical process § is a purely jump process under Q, with a finite number of 
jumps. Introduce the jump times by setting (To — and for k > 0, 

ok = inf{* > a k -i :B t ^S t -}AT. 

Next we introduce the largest random time 

N := minjfc : = T}. 

Then, N < oo almost surely and consequently, there exists deterministic positive 
integer to (depending on e) such that 

(5.1) Q(A> > m) < e. 

By the definition of the set B( w ), there is a decreasing sequence of strictly positive 
numbers tk i 0, with t\ = T, such that for i = 1, to, 

<n - o-i_i g {<4r=i u Q - a - s - 



Wiener space. Let (fl w ,J rW , P w ) be a complete probability space together with 
a standard m+2-dimensional Brownian motion jwj = {w^\ W^ m+2 ^J j , 

and the natural filtration = o\W s \s < t}. The next step is to construct a 
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martingale Z on the Brownian probability space (Q w , T w , P w ) together with a se- 
quence of stopping times (with respect to the Brownian filtration) n < T2 < ... < T m 
such that the distribution (under the Wiener measure P w ) of the random vec- 
tor (n, ...,T m , Z T1 , Z Tm ) will be equal to the distribution of the random vector 
(<7i, cr m , S CT1 , §<r m ) under the measure Q. Namely, 

(5.2) ((n, ...,r m , Z T1 , Z Tm ), P 147 ) = ((ai, ...,cr m ,S ffl , ...,§ CTm ), 

The construction will be done by induction, at each step we will construct the 
stopping time r k and Z rfc such that the conditional probability will be the same as 
in the case of the canonical process S under the measure Q. 

Construction ofr's and Z. For an integer n and given x\, ... ,x n , introduce the 
notation 

•En ■ — {%1 ) • • • , •En ) ■ 

Also set 

For k = l,...,m, define the functions : T fe x { — 1, l}''" 1 -> [0,1] by 

(5.3) * fe (a fc ; := Q (o* - tr k -i > a k \ A) , 
where 

A := [a. t - ai-i = on, § CT! - § <xt _ 1 = ft/iV, i < ft - l} , 

and 

(5.4) $ fe (a fe ; /3 fc _!) = Q (s fffc - = 1/N \ B ) , 
where 



B 



|cr fc < T, a 3 - (Tj-i = ctj, S CTj - Sct^j = A/iV, j < k,i < k — lj 



As usual we set Q(-|0) = 0. Next, for k < to, we define the maps r^,Ofc : T fc x 
{ — 1, l} k ~ 1 — > [—oo, oo], as the unique solutions of the following equations, 

(5.5) P w (wg> < T k {a k -J k -x)) = $ fc (cfc;&_i), 

and 

>Ww« w(!) ^cw^.fl ^ _ *fe(5fc-i,*/;^fe-i) 



(5.6) P w (<» - < + \ < e fc (S*;&_i)) 



*k(ak-i,*J+i;$fc-i) 

where I £ N is given by = i; € T. From the definitions it follows that 
*A,(3fc-i,f l ;&-i) < *fe(5 fc _ lj( i, +1 ;4-i)- Thus if # fc (a fc _i, i m ; /? fc -i) = for 
some I, then also ^k(&k-i,ti', Pk-i) = 0. We set 0/0 = 0. 

Set t = and define the random variables Ti, r TO , Yi, K m by the following 
recursive relations 

oo oo 

(5.7) n = E**x {w w_ w ci) >0i(tfe)} J] X {w w_ w w <Ql ( t3 .) } . 

fc=l fc fc+1 3=fc+l 3 3 + 
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and for i > 1 

n = n-i + A t 

Yi = X{n<T} f 2 x {w c* +I )_ w (i+i) >ri(zrTii?t _ i)} - 1 

where Aj = t k on the set A, n _B; & PI Cj & and zero otherwise. These sets are given 
by, 

Ai := {\Yi^\>0}, 

oo 

Ci,k ■= D {<+r 1 _ I -< 1 i+r j _ I <6 i (A , r i _ 1 ,i j ;ti)}. 

Since t k is decreasing with = T, n < T2 < ... < r m and they are stopping 
times with respect to the Brownian filtration. Let k < m and (a^; G T fe x 
{-1, l} fe_1 . There exists m G N such that a fc = i m G T. From (|5T7)) - (j5~l)) . the 
strong Markov property and the independency of the Brownian motion increments 
it follows that 

(5.8) P w {T k ~T k _ 1 > a k \(AT k ^;Y k ^) = {a k ^ k -x)) 

= P W ( f| (w t % k -> -< + r t _r <e fc (a,_ 1) ^;A- 1 ; 

OO ✓ 

= n pW Wk-x-<Jx-H 1k -x<®*(S*-l,t*;ft-l) 
= * fe (a fc ,Afe_l), 

where the last equality follows from (|5.6p and the fact that 
lim * fc (ai, ...,afc_i,^,/3i, = 1. 

Similarly, from (|5.5[) and (|5.8[> . we have 

life = l|Tfc <-. 

pW 



(5.9) P w [Y k = l|r fe < T, At fc = a fc ,y fe _i = /3 fe _^ 

(wg^-wgg^ir,^;^,)) 

= $ fe (a fe ;/3 A; _i). 
Using (jO]) - (j5"l| and ([5T5 ]) -([0 ]) . we conclude that 

((? m -±Y m ),P w } = ((a m ;A*S m ),Q) 

where A8fc = § CTfc — S r7k _ 1 , k < m. 
Continuous martingale. Set 

(5.10) z t = -E w (£Y i \?Y), te[o,T\. 
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Since all Brownian martingales are continuous, so is Z . Moreover, Brownian motion 
increments are independent and therefore, 

1 k 

(5.11) Z Tk = —^Y l: P w a.a., k<m. 

i=l 

By the construction of Y and r's, we conclude that (|5.2[) holds with the process Z. 



Measure in M M . The next step in the proof is to modify the martingale Z in 
such way that the distribution of the modified martingale will be an element of M„. 
For any two probability measures v-y ,v 2 on R the Prokhorov's metric is defined by 

<*(!/!, i*) = inf{5 > : vi(A) < v 2 {A s ) + S and v 2 {A) < vi{A s ) + 5, MA e S(R)}, 

where £>(R) is the set of all Borel sets A C R and A 5 := {J x£A {x — 5,x + 5) is the 
5 neighborhood of A. It is well known that convergence in the Prokhorov metric 
is equivalent to weak convergence, (for more details on the Prokhorov's metric see 
[TO) . Chapter 3, Section 7). 

Let v\ and v 2 , be the distributions of S<7 m and §t respectively, under the measure 
Q. In view of (fSTTj) . d(v x ,v 2 ) < e. Moreover, (j4~9]) implies that d{v 2 ,^ N ^) < § 
and /j,( N } converges to [i weakly. Hence, the preceding inequalities, together with 
this convergence yield that for all sufficiently large N, d(vi,fi) < 2e. Finally, we 
observe that in view of ([ST2"]l. (Z T , P w ) = v\- 

We now use Theorem 4 on page 358 in [TH] and Theorem 1 in to con- 
struct a measurable function ip : R 2 — > R such that the random variable A := 
4>{Z T , w4 m+2) ) satisfies 

(5.12) (A,P w )= t i and P W (\A - Z T \ > 2e|) < 2e. 
We define a martingale by, 

r t = E W (A\ F™), te [0,T]. 
In view of f|5.12[) . the distribution of the martingale T is an element in M M . Hence, 

(5.13) sup E P [G(S)] > i? M/ (G(r)). 

We continue with the estimate that connects the distribution of F to Q G . 
Observe that E w A = E w Z T = 1. This together with (|5TT2|) . positivity of Z and 
A, and the Holder inequality yields 

(5.14) E W \A-Z T \ = 2E W {A - Z T ) + ~ E w (A - Z T ) 

= 2E W {A-Z T ) + 

< 4,e + 2E w (A X{l A-z T \>2e}) 

< 4e + 2(J x p d f i(x)) 1 ^(2e) 1 / q , 

where p > 1 is as (|2.3|) and q = p/p — 1. 

We now introduce a stochastic process {Zt) t=0 , on the Brownian probability 
space, by, Z t = Z Tk for t e [rfe,Tfc+i), k <m and for i G [r m ,T], we set Zf = Z Tm . 
On the space {(l,PA^tf t=() ^ {N) ) let S< = § tACTm , * e [0,T]. Recall that G is 
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bounded by K. We now use the Assumption (|2.1|) together with (|5.ip and (|5.1ip 
to arrive at 

(5.15) |Eq(G(S)) - E Q (G(S))| < Ke 

\E W (G(Z)) - E W (G(Z))\ < LE W \\Z - Z\\ < ^. 

Recall that by <j52|) . (Z,P W ) = (S,Q). Thus, = E Q (£?(§)). This to- 

gether with (|5. 15|) yields 

(5.16) Eq(G(§)) <^+Ke+ E W {G{Z)). 

From Assumption 12. 1[ the Doob inequality, (|5.13l) - (|5.14[) and f|5 . we obtain 
sup E P G(8) > E W (G(T)) 

P£M„ 



> E W {G{Z)) - LeV* - E w (K X{M>el/2q} ) 



> E Q (G(S))-^-/ JC ( e ) 



where 



f K (e)=Ke + Le 1 ^ + K 



: + 2(J a*d//(a:)) 1/p (26) 1 /? 



,l/2g 

□ 
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